Transformation Rule Discovery through Data Mining

نویسندگان

  • Holger Kache
  • Yannick Saillet
  • Mary Roth
چکیده

Data-intensive software programs typically transform a set of source data values into target data values. While the group of developers who write, compile and test the software or the query have a clear understanding of the transformation logic (or transformation rules) at the time the program is created, that understanding can quickly fade at an enterprise level for a variety of reasons, including poor documentation, loss of the source (uncompiled) version of the software, loss of the developers who wrote the software, or lack of available skills in the programming language (e.g., COBOL). This leaves the enterprise in a precarious position of not being able to maintain, upgrade or migrate the software programs at the heart of their operations unless they can recreate the transformations that relate the source data to the target data. In this paper, we propose a technique to reverse engineer transformation rules by analyzing the data using data mining algorithms and processing the results of these algorithms. We demonstrate our technique using a prototype implementation and prove its correctness with a sample data set.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Application of Rough Set Theory in Data Mining for Decision Support Systems (DSSs)

Decision support systems (DSSs) are prevalent information systems for decision making in many competitive business environments. In a DSS, decision making process is intimately related to some factors which determine the quality of information systems and their related products. Traditional approaches to data analysis usually cannot be implemented in sophisticated Companies, where managers ne...

متن کامل

FUZZY GRAVITATIONAL SEARCH ALGORITHM AN APPROACH FOR DATA MINING

The concept of intelligently controlling the search process of gravitational search algorithm (GSA) is introduced to develop a novel data mining technique. The proposed method is called fuzzy GSA miner (FGSA-miner). At first a fuzzy controller is designed for adaptively controlling the gravitational coefficient and the number of effective objects, as two important parameters which play major ro...

متن کامل

Bridging Data Mining Model to the Automated Knowledge Base of Biomedical Informatics

The process of data mining comprises of seven major steps: (1) data integration, (2) data transformation, (3) data cleaning, (4) data selection, (5) pattern extraction or knowledge mining, (6) pattern evaluation, and (7) knowledge presentation. Steps 1 to 4 are pre-data mining, whereas steps 6 and 7 may be viewed as post-data mining. Therefore, the seven major steps can be grouped into pre-data...

متن کامل

A 2D-3D visualization support for human-centered rule mining

On account of the enormous amounts of rules that can be produced by data mining algorithms, knowledge post-processing is a difficult stage in an association rule discovery process. In order to find relevant knowledge, the user needs to rummage through the rules. To make this task easier, we propose a new interactive mining methodology based on well-adapted dynamic visual representations. It all...

متن کامل

Expert Discovery: A web mining approach

Expert discovery is a quest in search of finding an answer to a question: “Who is the best expert of a specific subject in a particular domain within peculiar array of parameters?” Expert with domain knowledge in any field is crucial for consulting in industry, academia and scientific community. Aim of this study is to address the issues for expert-finding task in real-world community. Collabor...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2008